Investigating the effects of the fixed and varying dispersion parameters of Poisson-gamma models on empirical Bayes estimates.
نویسندگان
چکیده
Traditionally, transportation safety analysts have used the empirical Bayes (EB) method to improve the estimate of the long-term mean of individual sites; to correct for the regression-to-the-mean (RTM) bias in before-after studies; and to identify hotspots or high risk locations. The EB method combines two different sources of information: (1) the expected number of crashes estimated via crash prediction models, and (2) the observed number of crashes at individual sites. Crash prediction models have traditionally been estimated using a negative binomial (NB) (or Poisson-gamma) modeling framework due to the over-dispersion commonly found in crash data. A weight factor is used to assign the relative influence of each source of information on the EB estimate. This factor is estimated using the mean and variance functions of the NB model. With recent trends that illustrated the dispersion parameter to be dependent upon the covariates of NB models, especially for traffic flow-only models, as well as varying as a function of different time-periods, there is a need to determine how these models may affect EB estimates. The objectives of this study are to examine how commonly used functional forms as well as fixed and time-varying dispersion parameters affect the EB estimates. To accomplish the study objectives, several traffic flow-only crash prediction models were estimated using a sample of rural three-legged intersections located in California. Two types of aggregated and time-specific models were produced: (1) the traditional NB model with a fixed dispersion parameter and (2) the generalized NB model (GNB) with a time-varying dispersion parameter, which is also dependent upon the covariates of the model. Several statistical methods were used to compare the fitting performance of the various functional forms. The results of the study show that the selection of the functional form of NB models has an important effect on EB estimates both in terms of estimated values, weight factors, and dispersion parameters. Time-specific models with a varying dispersion parameter provide better statistical performance in terms of goodness-of-fit (GOF) than aggregated multi-year models. Furthermore, the identification of hazardous sites, using the EB method, can be significantly affected when a GNB model with a time-varying dispersion parameter is used. Thus, erroneously selecting a functional form may lead to select the wrong sites for treatment. The study concludes that transportation safety analysts should not automatically use an existing functional form for modeling motor vehicle crashes without conducting rigorous analyses to estimate the most appropriate functional form linking crashes with traffic flow.
منابع مشابه
تنظیم نقشه خطر نسبی مرگ کودکان زیر یکسال مناطق روستایی کشور در سال 1380 و 1385: مقایسه روشهای حداکثر درستنمایی و بیزی
Background and objective: Disease or mortality mapping are statistical methods aimed at providing precise estimates of rates across geographical maps. The aim of this research is to improve the precision of relative risk (RR) estimates of infant mortality (IM) for different rural areas, using empirical and full Bayesian methods. Methods: Infant mortality data were extracted from the vital ...
متن کاملEffects of the Varying Dispersion Parameter of Poisson-gamma models on the estimation of Confidence Intervals of Crash Prediction models
The most common probabilistic structure of the models used by transportation safety analysts for modeling motor vehicle crashes are the traditional Poisson and Poissongamma (or Negative Binomial) distributions. Since crash data have been shown to exhibit over-dispersion, Poisson-gamma models are usually preferred over Poisson regression models. Up until recently, the dispersion parameter of Poi...
متن کاملApproximating Bayes Estimates by Means of the Tierney Kadane, Importance Sampling and Metropolis-Hastings within Gibbs Methods in the Poisson-Exponential Distribution: A Comparative Study
Here, we work on the problem of point estimation of the parameters of the Poisson-exponential distribution through the Bayesian and maximum likelihood methods based on complete samples. The point Bayes estimates under the symmetric squared error loss (SEL) function are approximated using three methods, namely the Tierney Kadane approximation method, the importance sampling method and the Metrop...
متن کاملمدل سازی بیزی حوادث ناشی از کار ایران در سال 1388
Introduction: It is of prime importance to consider the pattern and geographical changes of a disease, in each community independently, to determine high and low risk areas. Mapping diseases is a set of statistical methods which attempt to provide precise maps by which the geographical distribution of a disease is estimated. In this study, Bayesian methods were applied to estimate the relati...
متن کاملبه کارگیری بیز تجربی در تهیه نقشه جغرافیایی بروز بیماری سل در استان مازندران طی سالهای 90-1384
Background and purpose: Due to the increasing information about illnesses and deaths, classified map is of appropriate methods for analyzing this type of data. Standardized infection rates are commonly used in disease mapping but had many defects. This study aimed to compare the Poisson regression models and empirical Bayes models to prepare geographical map of tuberculosis incidence in Mazanda...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Accident; analysis and prevention
دوره 40 4 شماره
صفحات -
تاریخ انتشار 2008